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Water-Soluble Thioester and Selenoester Compounds and 
Methods for Making and Using the Same 



Cross-Reference to Related Applications 

5 

This application claims the benefit of U.S. Provisional Application Serial No. 
60/437,285, filed December 30, 2002, which application is incorporated herein by 
reference in its entirety. 

10 Introduction 

Field of the Invention 

The invention relates to thioester and selenoester compounds, their 
manufacture and use, and more particularly peptide thioesters and peptide 
selenoesters, their generators, synthesis, and use. 



Background of the Invention 



Thioesters and selenoesters represent an important class of molecules that 
readily react with nucleophiles. Thioesters are particularly useful for conjugation 
and chemoselective ligation reactions. Chemical ligation involves the 

20 chemoselective covalent linkage of a first chemical component to a second 
chemical component. Unique, mutually reactive functional groups present on the 
first and second components can be used to render the ligation reaction 
chemoselective. For example, thioesters are commonly used to direct the 
chemoselective chemical ligation of peptides and polypeptides. Several different 

25 thioester-mediated chemistries have been utilized for this purpose, such as native 
chemical ligation (Dawson, et al., Science (1994) 266:776-779; Kent, eta I., WO 
96/34878; Kent, et al., WO 98/28434), and involve the use of peptide thioesters and 
peptide selenoesters. 

Synthesis of peptide thioesters and selenoesters is typically carried out using 

30 the principals of solid phase organic chemistry (Hackeng, et al., PNAS (1999) 96: 
10068-10073; and Schnolzer, et al., Int. J. Pept. Prot. Res., (1992) 40:180-193). 
Solid phase peptide synthesis (SPPS) involves the stepwise assembly of amino 
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acids on a solid support. Peptides are "grown" on the support through successive 
cycles of coupling and deprotection of incoming amino acids, followed by cleavage 
of the assembled peptide from the support. The initial cleavage product is 
considered to be "crude" as it contains the desired peptide along with partial 
5 sequences, side reactants, residual organics and the like. So subsequent 
purification of the desired product from the crude cleavage product is typically 
required. Unfortunately, some peptide sequences can be difficult to make and 
purify. 

Solubility is one factor that influences the synthesis and subsequent 

10 handling properties of the peptide. In addition, even though on-resin chain 
assembly and cleavage may proceed efficiently, the crude cleavage product of a 
"difficult" peptide once released from the support can be poorly soluble in aqueous 
or mixed aqueous-organic solutions and may form aggregates. Such peptides 
appear as a smear of products when the crude product is examined by various 

15 analytical techniques such as High Performance Liquid Chromatography (HPLC). 
Thus, the desired product can be difficult to separate from the crude cleavage 
product, resulting in lower purity and lower overall purification yields. 

Synthesis of difficult peptide thioesters and selenoesters represents 
additional challenges since care must be taken to avoid nucleophiles during 

20 synthesis, purification and during handling of the peptides. Moreover, for those 
peptide thioesters and selenoesters that exhibit poor handling properties, such as 
low solubility or aggregation in aqueous solution, the rate, purity and overall yield of 
a given peptide ligation reaction can be diminished. 

Accordingly, it is of interest to develop approaches that overcome at least 

25 some of the above-described problems with respect to solubility and aggregation 
of crude and final peptide products. Of particular interest would be the 
development of such approaches for peptide thioesters or selenoesters. The 
present invention satisfies these needs, as well as others. 

30 Summary of the Invention 

The invention is directed to water-soluble thioester and selenoester 
compounds, generators thereof, as well as methods for making and using the 
same. The subject thioester and selenoester compounds are characterized by 
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including ah amino acid synthon joined to a water-soluble polymer through a 
thioester or a selenoester. Solid phase polypeptide synthesis (SPPS) resins and 
protocols for generating the subject compounds are also provided. The subject 
water-soluble thioester and selenoester compounds find use in a variety of 
5 different applications, including thioester or selenoester mediated chemical ligation 
reactions. 

Brief Description of the Figures 

The invention will be more fully understood by reference to the following 
10 drawings, which are for illustrative purposes only. 

Figure 1 depicts four general compositions in accordance with the 
invention. 

Figure 2 depicts an exemplary use of a water-soluble thioester or selenol 
ester compounds of the invention in a method of native chemical ligation. 
15 Figures 3A-3B depict a Reversed-Phase High Performance Liquid 

Chromatography (HPLC) chromatogram illustrating purities of a crude cleavage 
product containing an exemplary peptide-thioester devoid of a water-soluble 
polymer (FIG- 3A), compared the same peptide-thioester that includes a water 
soluble polymer (FIG. 3B). 

20 

Before the invention is further described in detail, it is to be understood that 
this invention is not limited to particular embodiments described, and as such may, 
of course, vary. It is also to be understood that the terminology used herein is for 
the purpose of describing particular embodiments only, and is not intended to be 
25 limiting, since the scope of the present invention will be limited only by the 
appended claims. Also, as used herein and in the appended claims, the singular 
forms "a", "and", and "the" include plural referents unless the context clearly 
dictates otherwise. 

Where a range of values is provided, it is understood that each intervening 
30 value, to the tenth of the unit of the lower limit unless the context clearly dictates 
otherwise, between the upper and lower limit of that range and any other stated or 
intervening value in that stated range is encompassed within the invention. The 
upper and lower limits of these smaller ranges may independently be included in 
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the smaller ranges is also encompassed , -within the invention, subject to any 
specifically excluded limit in the stated range. Where the-stated range includes 
one or both of the limits, ranges excluding either both of those included limits are 
also included in the invention. 
5 Unless defined otherwise, all technical and scientific terms used herein 

have the same meaning as commonly understood by one of ordinary skill in the art 
to which this invention belongs. Although any methods and materials similar or 
equivalent to those described herein can also be used in the practice or testing of 
the present invention, the preferred methods and materials are now described. All 
10 publications mentioned herein are incorporated herein by reference to disclose and 
describe the methods and/or materials in connection with which the publications 
are cited. 

The publications discussed herein are provided solely for their disclosure 
prior to the filing date of the present application. Nothing herein is to be construed 
15 as an admission that the present invention is not entitled to antedate such 
publication by virtue of prior invention. Further, the dates of publication provided 
may be different from the actual publication dates which may need to be 
independently confirmed. 

20 Description of the Specific Embodiments 

The invention is directed to water-soluble thioester and selenoester 
compounds, generators thereof, and related methods for their production and use. 
The subject thioester and selenoester compounds of the invention are 
characterized by including an amino acid synthon that is joined to a water-soluble 

25 polymer through a thioester or a selenoester. The subject thioester and 
selenoester compounds can be readily made using thioester and selenoester 
generators of the invention. 

The thioester and selenoester generators of the invention comprise an 
amino acid synthon having an N-terminal group joined to a C-terminal group 

30 through an organic backbone, where the C-terminal group is joined to a water- 
soluble polymer through a thioester or selenoester moiety, and where the amino 
acid synthon is joined "to a support through a linker that is cleavable under non- 
nucleophilic conditions. Cleavage of the linker under non-nucleophilic conditions 
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generates the desired water-soluble 5 thioester and selenoester compounds free of 
the support. 

Such water-soluble thioester and selenoester compounds and their 
generators find a broad range of use in a variety of different applications, including 
5 wide applicability in organic synthesis. For instance, water-soluble thioester and 
selenoester compounds of the invention that comprise peptides or polypeptides 
are particularly useful in peptide and polypeptide synthesis techniques that employ 
thioester- and/or selenoester-mediated ligation, including native chemical ligation. 
The invention offers many advantages over the prior art in that the solid 
10 phase synthesis resins and protocols of the invention yield water-soluble thioester 
and selenoester compounds, which are easily purified and/or manipulated. The 
compounds of the invention also can improve the overall efficiency and yields of 
complex multi-step ligation or conjugation schemes. 

In further describing the subject invention, representative thioester and 
15 selenoester compounds are described first in greater detail, followed by a 
description of representative compositions and methods for making and using the 
subject compounds. 

Water-soluble Thioester and Selenoester Compounds 

20 The water-soluble thioester and selenoester compounds of the invention 

include an amino acid synthon having an N-terminal group joined to a C-terminal 
group through an organic backbone, where the C-terminal group is joined to. a 
water-soluble polymer through a thioester or selenoester. The thioester or 
selenoester may or may not be sterically hindered. The organic backbone may 

25 comprise a target molecule of interest, such as an amino acid, peptide, polypeptide 
or other organic compound of interest. 

In certain embodiments, the water-soluble thioester or selenoester 
compounds bear an N-terminal group having a moiety selected from: (i) a 
functional g roup p rotected with a p rotecting g roup, ( ii) a n unprotected functional 

30 group, or (iii) an unprotected group that is substantially unreactive. A preferred N- 
terminal group comprises a moiety selected from a free amine, an amine protected 
with a nucleophile-stable amine protecting group, and an unprotected group 
lacking a reactive functionality, such as a unreactive alkyl or aryl capping moiety 
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that may be linear, branched, substituted or unsubstituted. Another preferred 
embodiment employs an N-terminal group that is capable of supporting chemical 
ligation, and may be protected or unprotected. 

In certain embodiments, the water-soluble thioester or selenoester 
5 compounds of the invention possess a C-terminal group bonded to a thioester or 
selenoester that includes a water-soluble polymer, where the water-soluble 
polymer component will preferably include a repeat unit comprising a polyalkylene 
oxide, a polyamide alkylene oxide, or derivatives thereof. Most preferably, the 
polyalkylene oxide and polyamide alkylene oxide include an ethylene oxide repeat 

10 unit of the formula -(CH2-CH2-O)-. The water-soluble polymer may be linear or 
branched, including dendrimer structures. In a preferred embodiment, the water- 
soluble polymer will have a net charge under physiological conditions. Depending 
on the construct employed, the net charge can be positive, neutral or negative. 
The most preferred water-soluble polymers of the invention are mono-disperse, 

15 i.e., a single molecular species as distinguished from hetero-disperse compounds 
composed of multiple different molecular species. Utilization of a mono-disperse 
water-soluble polymer has the advantage of permitting the construction of 
compounds that are also mono-disperse. Most preferably, the water-soluble 
polymer will have a discrete molecular weight that is sufficient to permit separation 

20 of the compound from a corresponding thioester or selenoester compound that is 
missing the water-soluble polymer. In other embodiments, the C-terminal group 
includes a moiety chosen from a sterically hindered thioester or sterically hindered 
selenoester, where the sterically hindered thioester or selenoester includes a 
water-soluble polymer. Preferably, the water-soluble polymer will include a repeat 

25 unit c omprising a p olyalkylene o xide, a p olyamide a Ikylene o xide, o r d eriyatives 
thereof. Here again, it is most preferable that the water-soluble polymer has a 
discrete molecular weight that is sufficient to provide for separation from a 
corresponding sterically hindered thioester or sterically hindered selenoester 
compound that is missing the water-soluble polymer. 

30 By "amino acid synthon" is intended to refer to a structural unit within a 

molecule, the structural unit including at least one amino acid or amino acid 
residue having an N-terminus comprising or extending from the alpha nitrogen of 
the amino acid or amino acid residue, a C-terminus comprising or extending from 
the alpha carbonyl of the amino acid or amino acid residue and an organic 
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backbone that joins the N- and C-termini and is substituted or unsubstituted with 
one or more side chains, where the structural unit can be formed and/or 
assembled 'by known or conceivable synthetic operations. 

Examples of amino acid synthons are unprotected and partially or fully 
5 protected amino acids and peptides having a modified or unmodified alpha amino 
terminus (N-terminus) and/or a modified or unmodified alpha carbonyl terminus (C- 
terminus), including un-activated and activated esters thereof, as well as salts 
thereof, such as trifluoroacetic acid (TFA) salts. It also includes variable forms 
thereof in which the pendant N- and/or C-termini comprise terminal groups other 

10 than an alpha amino or carbonyl moiety, such as other amino acid non-functional 
and functional groups, one or more protecting groups, halogens, azides, 
conjugates, organic moieties other than an amino acid, a target molecule of 
interest or components thereof, depending on the intended end use. 

The term "amino acid" means any of the 20 genetically encodable amino 

15 acids, non-encoded amino acids, and analogs and derivatives thereof, including a- 
amino acids, p-amino acids, y-amino acids, and other compounds having at least 
one N-terminal amino functionality and at least one C-terminal carboxyl (or 
carbonyl) functionality thereon. L- and D-forms of the chiral amino acids are also 
contemplated. The terms "peptide", "polypeptide" and "protein", which may be 

20 used interchangeably herein, refer to an oligomeric or polymeric form of amino 
acids, which can include coded and non-coded amino acids, chemically or 
biochemically modified or derivatized amino acids and polypeptides having 
modified peptide backbones. 

In the context of an amino acid synthon, an "organic backbone" may 

25 comprise the alpha, beta and/or gamma carbons of a single amino acid residue, 
and other substituents, including additional backbone carbons and/or heteroatoms, 
as well as alpha amino groups of an amino acid or residue that are substituted or 
unsubstituted (amides included), alpha carbonyls that are substituted or 
unsubstituted (carboxyls, carboxyesters and amide bonds included), and may 

30 comprise an amino acid residue or peptide, as well as organic side chains. 
Representative organic side chains are those of amino acids. The organic 
backbone typically comprises a part of or most of a target molecule of interest. 

The organic backbone may be fully protected, partially protected or 
unprotected depending on the intended end use. This includes organic backbones 
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that have one or more side chains bearing a functional group protected with a 
protecting group, as well as different protecting groups thereon that are removable 
orthogonal conditions. Such a configuration is particularly convenient where the 
organic backbone is constructed using solution or solid phase chain assembly 
5 methods. For instance, the organic backbone may include a peptide chain 
containing amino acid residues bearing protected functional groups removable 
under conditions orthogonal to removal of an N-terminal protecting group during 
peptide elongation cycles. In other instances, the organic backbone may contain 
one or more side chains bearing a functional group protected with a protecting 

10 group that is removable under the same conditions as the N-terminal protecting 
group. Alternatively, the organic backbone may contain one or more side chain 
functional groups that are substantially non-reactive to conditions used for 
generating or manipulating a target molecule during elongation or elaboration 
cycles, and/or side chains that would otherwise be reactive but are protected with 

15 protecting groups that are orthogonal to such generating or manipulating 
conditions. 

The term "orthogonal" as used herein with respect to protecting groups, 
linkers and other groups means that the specific group or linker is removable or 
cleavable under conditions that do not result in removal or cleavage of an 

20 "orthogonal" group or I inker. Thus, for example, where one protecting group is 
nucleophile-stable and another is n ucleophile-labile, such groups a re said to be 
"orthogonal". For instance, when the organic backbone is made to contain 
cysteine amino acid residues, the side chain thiol can be protected with an 
acetamidomethyl (Acm) or Picolyl group, which are stable to basic conditions (e.g., 

25 typical conditions for Fmoc-compatible cycles) or acidic conditions (e.g., typical 
Boc-compatible cycles). Protecting groups like Acm- and Picolyl also are 
removable under conditions orthogonal to carbonyl protecting groups such as Allyl 
or ODMab, as well as to primary amine protecting groups such as Alloc. Where 
the organic backbone contains side chain functional groups that are substantially 

30 unreactive, protection of those groups is typically not required. Examples of side 
chain groups that are substantially unreactive include saturated alkyl groups, 
chains and alcohols, and other such groups can be selected depending on the 
conditions employed. 
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As described above, the thioester and selenoester compounds of the 
invention may have a modified or unmodified alpha amino terminus (N-terminus). 
In a preferred embodiment, the thioester and selenoester compounds of the 
invention have an N-terminal group that comprises an amino acid. Any amino acid 
5 can be used. I n certain embodiments, the amino acid is capable of supporting 
chemical ligation. 

Chemical ligation involves the selective covalent linkage of a first chemical 
component to a second chemical component. Orthogonally reactive functional 
groups present on the first and second components can be used to render the 

10 ligation reaction chemoselective. For example, chemical ligation of peptides and 
polypeptides involves the chemoselective reaction of peptide or polypeptide 
segments bearing compatible, mutually reactive C-terminal and N-terminal amino 
acids. Several different chemistries have been utilized for this purpose, examples 
of which include native chemical ligation (Dawson, et al., Science (1994) 266:776- 

15 779; Kent, et al., WO 96/34878; Kent et al., US 6,184,344), extended general 
chemical ligation (Kent, et al., WO 98/28434; and Kent et al., US 6,307,018); 
extended native chemical ligation (Botti et al., WO 02/20557); oxime-forming 
chemical ligation (Rose, et al.. J. Amer. Chem. Soc. (1994) 116:30-33), thioester 
forming ligation (Schnolzer, et al., Science (1992) 256:221-225), thioether forming 

20 ligation (Englebretsen, et al., Tet. Letts. (1995) 36(48):8871-8874), hydrazone 
forming ligation (Gaertner, et al., Bioconj. Chem. (1994) 5(4):333-338), and 
thiazolidine forming ligation and oxazolidine forming ligation (Zhang, et al., Proc. 
Natl. Acad. Sci. (1998) 95(1 6):91 84-91 89; Tarn, et al., WO 95/00846) or by other 
methods (Yan, L.Z. and Dawson, P.E., "Synthesis of Peptides and Proteins without 

25 Cysteine Residues by Native Chemical Ligation Combined with Desulfurization," J. 
Am. Chem. Soc. (2001) 123:526-533; Gieselnan et al., Org. Lett. (2001) 3(9): 1331- 
1334; Saxon, E. et al., "Traceless" Staudinger Ligation for the Chemoselective 
Synthesis of Amide Bonds. Org. Lett. (2000) 2:2141-2143). In many embodiments, 
chemical ligation methods of interest employ amide-forming chemical ligation, such 

30 as native chemical ligation and extended native chemical ligation. 

By "capable of supporting chemical ligation" is intended to refer to a moiety 
that is in a form which can be directly employed in a chemical ligation reaction, or 
can be converted to a moiety for use in a chemical ligation reaction. In many 
situations, a moiety capable of supporting chemical ligation will be in a form that 
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must be converted for a ligation reaction to proceed. For instance, when a water- 
soluble thioester or selenoester compound of the invention is employed for making 
a target molecule bearing an N-terminal amino acid capable of supporting 
chemical ligation in combination with a C-terminal thioester or selenoester having a 
5 water-soluble polymer, the N-terminal amino acid is typically protected by a 
method to avoid intramolecular cyclization unless such cyclization is desired. By 
protecting the N-terminal amino acid in this way, such a target molecule can be 
used for a t hioester or s elenoester-mediated c hemical I igation reaction, s uch a s 
native or extended native chemical ligation, followed by removal of the N-terminal 

10 protection for subsequent native or extended native chemical ligation reaction 
cycles (e.g., sequential native or extended native chemical ligation). In some 
instances, however, intramolecular cyclization may be desired, which is particularly 
useful for making cyclic products, such as cyclic peptides. N-terminal amino acids, 
such as serines, that are capable of being converted to bear an aldehyde moiety 

15 by mild oxidation or reductive alkylation is another example, and are particularly 
useful in Schiff-base mediated chemical ligation reactions. In other chemical 
ligation reactions, the N-terminal amino acid can be provided in a ready-to-use 
chemical ligation form, such as when the N-terminal amino acid bears an azide, 
halogen or aminooxy group for other chemical ligation reactions. 

20 Where the N-terminal group comprises an amino acid capable of supporting 

native or extended native chemical ligation, the amino acid comprises a side chain 
bearing an atom selected from sulfur and selenium. Examples of amino acids 
suitable for use in native chemical ligation include an alpha-carbon side chain 
bearing a sulfur or selenium atom, such as cysteine, homocysteine, 

25 selenocysteine, homoselenocysteine and respective protected forms thereof. 
Examples of amino acids suitable for use in extended native chemical ligation 
comprise an alpha-nitrogen side chain bearing a sulfur or selenium atom, which 
include the alpha-nitrogen substituted 2 or 3 carbon chain alkyl or aryl thiol and 
selenol a uxiliaries, a nd p rotected forms t hereof a s d escribed i n B otti eta I., WO 

30 02/20557. As can be appreciated, an N-terminal amino acid capable of supporting 
native or extended native chemical ligation can be protected using a protecting 
group for the alpha-nitrogen, the side chain sulfur or selenium, or a combination of 
both, including cyclic protection strategies employing an N-terminal thioproline or 
extended native chemical ligation alpha-nitrogen substituted auxiliary. The 
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thioester and selenoester generators of the invention preferably employ an amino 
acid bearing a side chain sulfur or selenium group that is protected. 

As described above, the C-terminal group of the thioester and selenoester 
compounds of the invention includes an ester of sulfur or selenium, i.e., thioester 
5 and selenoester, at the G-terminal end of the amino acid synthon. The ester is 
further characterized in that it includes a water-soluble polymer bonded, either 
directly or indirectly through a linker or spacer, to the sulfur or selenium atom of the 
ester. The ester moiety may include any linker or spacer group compatible with 
the thioester or selenoester group, including, but not limited to, aryl, benzyl, and 

10 alkyl groups that may be linear, branched, substituted or unsubstituted, and which 
includes amino acid, peptide and other organic thioester or selenoester moieties. 
Representative examples include 3-carboxy-4-nitrophenyl thioesters, benzyl 
thioesters and selenoesters, mercaptopropionyl thioesters and selenoipropionyl 
selenoesters, and mercaptopropionic acid leucine thioesters and selenolpropionic 

15 acid leucine selenoesters (See, e.g., Dawson et al., Science (1994) 266:776-779; 
Canne et al. Tetrahedron Lett. (1995) 36:1217-1220; Kent, et al., WO 96/34878; 
Kent, et al., WO 98/28434; Ingenito et al., J. Am. Chem. Soc (1999) 
121 (49):1 1369-1 1374; and Hackeng et al., Proc. Natl. Acad. Sci. U.S.A. (1999) 
96:10068-10073), that are linked or derivatized and joined to a water-soluble 

20 polymer of the invention directly or indirectly though a spacer. 

In certain embodiments, the C-terminal group comprises the formula 
J-CH(R 2 )-C(0)-X-R 3 , where J comprises a residue of the organic backbone; R 2 
comprises any side chain group; X is sulfur or selenium; and R 3 is any thioester or 
selenoester compatible group that includes a water-soluble polymer group. In this 

25 formula, R2 may be any organic side chain compatible with thioesters or 
selenoester, such as a side chain of an amino acid, and R 3 may be selected from a 
group of the formula -C(R 7 )(R 8 )-U-Polymer, where R 7 and R 8 each individually are 
selected from hydrogen, linear, branched, substituted and unsubstituted alkyl, aryl, 
heteroaryl and benzyl groups, and where U is a linker or spacer that may be 

30 present or absent, and "Polymer" is a water-soluble polymer. The U group may 
include linear or branched moieties comprising one or more repeat units employed 
in the water-soluble polymer, natural or unnatural amino acids or derivatives 
thereof, as well as aliphatic moieties, including alkyl, aryl, heteroalkyl, heteroaryl, 
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alkoxy and the like, which preferably contain up to 18 carbon atoms or even an 
additional polymer chain. 

The formula J-CH(R 2 )-C(0)-X-R3 also includes C-terminal groups that 
comprise a sterically hindered thioester or selenoester, where R 2 and/or Rj may be 
5 a g roup t hat s terically h inders t he thioester o r s elenoester m oiety - C(0)-X-. B y 
"sterically hindering" or "sterically hindered" is intended to refer to a group or 
groups that prevent or help prevent hydrolysis or self-induced, aminolysis 
associated with the -C(0)-X- moiety. Accordingly, in this embodiment, R 2 may be 
selected from a branching group having the formula -C(R4)(Rs)(R6), where R4, R5 

10 and Re each individually are selected from hydrogen and linear, branched, 
substituted and unsubstituted alkyl, aryl, heteroaryl and benzyl groups, with the 
proviso that when R 3 is devoid of a sterically hindering group, then two or more of 
R4, R 5 a nd R 6 are s elected from I inear, branched, s ubstituted a nd u nsubstituted 
alkyl, aryl, heteroaryl and benzyl groups, and R 3 may be selected from a group of 

1 5 the formula -C(R 7 )(R 8 )-U-Polymer, where R 7 and R 8 each individually are selected 
from hydrogen and linear, branched, substituted and unsubstituted alkyl, aryl, 
heteroaryl and benzyl groups, with the proviso that when R 2 is devoid of a 
sterically hindering group, then one or more of R7 and Rs are selected from linear, 
branched, substituted and unsubstituted alkyl, aryl, heteroaryl and benzyl groups 

20 for sterically hindered thioester or sterically hindered selenoester, and where U 
and Polymer are as described above. 

As described above, a feature of the thioester and selenoester moieties of 
the subject compounds is the presence of a water-soluble polymer. In other 
words, the thioester or selenoester moieties of the subject compounds include a 

25 water-soluble polymer moiety. Typically, the water-soluble polymer moiety is 
bonded either directly or through a linking or spacer group to the sulfur or selenium 
atom of the ester moiety. By "water-soluble polymer" is intended to mean a 
polymer that is soluble in water and has an atomic molecular weight greater than 
about 500 Daltons. The water-soluble polymer may have an effective 

30 hydrodynamic molecular weight of greater than 1000 Da, with 1,000 to about 
500,000 Da, and more preferably, from about 1,000 to about 100,000 Da, and 
most preferably, from about 1 ,500 to about 50,000 Da. By "effective hydrodynamic 
molecular weight" is intended to refer to the effective water-solvated size of a 
polymer chain a s determined by aqueous-based size exclusion chromatography 
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(SEC). When the water-soluble polymer contains polymer chains having 
polyalkylene oxide repeat units, such as ethylene oxide repeat units, each chain 
may have an atomic molecular weight of between about 200 and about 80,000 Da, 
such as between about 500 and about 40,000 Da, with 500 to about 5,000 Da 
5 being the size range in certain embodiments. Unless referred to specifically, 
molecular weight is intended to refer to atomic molecular weight 

The water-soluble polymer component can have a wide range of molecular 
weight, and polymer subunits. These subunits may include a biological polymer, a 
synthetic polymer, or a combination thereof. Examples of such water-soluble 

10 polymers include: dextran and dextran derivatives, including dextran sulfate, P- 
amino cross linked dextrin, and carboxymethyl dextrin, cellulose and cellulose 
derivatives, including methylcellulose and carboxymethyl cellulose, starch and 
dextrines, and derivatives and hydroylactes of starch, polyalklyene glycol and 
derivatives thereof, including polyethylene glycol, methoxypolyethylene glycol, 

15 polyethylene glycol homopolymers, polypropylene glycol homopolymers, 
copolymers of ethylene glycol with propylene glycol, wherein said homopolymers 
and copolymers are unsubstituted or substituted at one end with an alkyl group, 
heparin and fragments of heparin, polyvinyl alcohol and polyvinyl ethyl ethers, 
polyvinylpyrrolidone, aspartamide, and polyoxyethylated polyols, with the dextran 

20 and dextran derivatives, dextrine and dextrine derivatives. It will be appreciated 
that various derivatives of the specifically recited water-soluble polymers are also 
contemplated. 

Water-soluble polymers such as those described above are well known, 
particularly the polyalkylene oxide based polymers such as polyethylene glycol 

25 "PEG" (See. e.g., "Poly(ethylene glycol) Chemistry: Biotechnical a nd Biomedical 
Applications", J.M. Harris, Ed., Plenum Press, New York, NY (1992); and 
"Poly(ethylene glycol) Chemistry and Biological Applications", J.M. Harris and S. 
Zalipsky, Eds., ACS (1997); and International Patent Applications: WO 90/13540, 
WO 92/00748, WO 92/16555, WO 94/04193,WO 94/14758, WO 94/17039, WO 

30 94/18247, WO 94/28937, WO 95/11924, WO 96/00080, WO 96/23794, WO 
98/07713, WO 98/41562, WO 98/48837, WO 99/30727, WO 99/32134, WO 
99/33483, WO 99/53951, WO 01/26692, WO 95/13312, WO 96/21469, WO 
97/03106, WO 99/45964, and US Patents Nos. 4,179,337; 5;075,046; 5,089,261; 
5,100,992; 5,134,192; 5,166,309; 5,171,264; 5,213,891; 5,219,564; 5,275,838; 
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5,281,698; 5,298,643; 5,312,808; 5,321,095; 5,324,844; 5,349,001; 5,352,756; 
5,405,877; 5,455027; 5,446,090; 5,470,829; 5,478,805; 5,567,422; 5,605,976; 
5,612,460; 5,614549; 5,618,528; 5,672,662; 5,637,749; 5,643,575; 5,650,388; 
5,681,567; 5,686,110; 5,730,990; 5,739,208; 5,756,593; 5,808,096; 5,824, 778; 
5 5,824,784; 5,840,900; 5,874,500; 5,880,131; 5,900,461; 5,902,588; 5,919,442; 
5,919,455; 5,932,462; 5,965,119; 5,965,566; 5,985,263; 5,990,237; 6,011,042; 
6,013,283; 6,077, 939; 6,113,906; 6,127355; 6,177,087; 6,180,095; 6,194,580; 
6,214,966). 

In certain embodiments, the water-soluble polymer component comprises a 

10 polyalkylene oxide, polyamide alkylene oxide, or derivatives thereof. A more 
typical polyalkylene oxide and polyamide alkylene oxide comprise an ethylene 
oxide repeat unit of the formula -(CH 2 -CH 2 -0)-. A representative ■ polymer 
component is a polyamide having a molecular weight greater than about 500 
Daltons of the formula -[C(0)-c|>-C(0)-NH-\|/-NH]n5- or -[NH-Yi/-NH-C(OH>-C(0)]n 5 -, 

15 where <j> and \\f are divalent radicals that may be the same or different and may be 
branched or linear, and n 5 is a discrete integer from 2 to 100, and more preferably 
from 2 to 50, and where either or both of <j> and iy comprises a biocompatible, 
substantially non-antigenic water-soluble repeat unit that may be linear or 
branched. In certain embodiments, the water-soluble repeat unit comprises an 

20 ethylene oxide of the formula -(CH 2 -CH 2 -0)- or -(0-CH 2 -CH 2 )-. The number of 
such water-soluble repeat units can vary significantly, but the more preferred 
number of such units is from 2 to 500, 2 to 400, 2 to 300, 2 to 200, 2 to 100 and 
sometimes 2 to 50. An example of such an embodiment is where one or both of <}> 
and v is selected from: -((CH 2 )n6-(CH 2 -CH 2 -0) n 7-(CH 2 )n 6 ->- or -((CH 2 )„6-(0-CH 2 - 

25 CH 2 )n7-(CH 2 ) n6 -)-, where n 6 is 1 to 6, 1 to 5, 1 to 4 and most preferably 1 to 3, and 
where n 7 is 2 to 50, 2 to 25, 2 to 15, 2 to 10, 2 to 8 and most preferably 2 to 5. An 
example of a highly preferred embodiment is where <J> is -(CH 2 -CH 2 )-, and where \|/ 
is -(CH 2 -(CH 2 -CH 2 -0) 3 -CH 2 -CH 2 -CH 2 )- or -(CH2-CH 2 -CH 2 -(0-CH2-CH 2 ) 3 -CH 2 )-. 
Preferred water-soluble polymers and exemplary synthesis approaches, including 

30 those of the formula -[C(0)-<|>-C(0)-NH-i}/-NH] n 5- or -[NH-xi/-NH-C(0)-<j)-C(0)] n5 - 
are described in PCT Publication Nos. WO 02/19963 and WO 02/20033. 

The more preferred water-soluble polymer is one produced in total by 
stepwise synthesis. This permits construction of polymers having a precise 
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molecuter- -weight and defined structure. In contrast, normal polymer synthesis, 
which is a polymerization process, results in a mixture in which chains are of 
differing lengths. Thus, there is a distribution of molecular weights and sizes that 
are difficult if not impossible to separate. The ability to control molecular purity is 
5 advantageous in that thioester or selenoester compounds of the invention can be 
constructed to include a water-soluble polymer that is mono-disperse, i.e., a single 
molecular species as distinguished from hetero-disperse compounds composed of 
multiple different molecular species. This represents a significant advantage in 
that variable properties resulting from heterogeneous compounds can be avoided, 

10 including contamination with side-reaction byproducts, and thus, only those 
thioester or selenoester compounds with the most preferred properties such as 
high purity can be prepared and isolated with relative ease. 

As noted above, water-soluble polymers that are made in total by stepwise 
assembly can be made as mono-disperse, for example, the preferred polyamide 

15 ethylene oxides of the invention, such as those of the formula -[C(0)-<j>-C(0)-NH- 
M/-NH]n5- or -[NH-\i/-NH-C(0)-<f>-C(0)]n5-- Such polyamide polymers combine the 
advantages of stepwise chain assembly along with the precision length, flexibility 
and water-solubility properties (sometimes referred to as "pPEG", "precise length 
polymer" or "PLP"). Such PLP moieties can be synthesized in any of a variety of 

20 ways. Such moieties are, however, preferably produced using a solid phase 
stepwise chain assembly of units. The use of such an assembly process permits 
the moieties of a preparation to have a defined and homogeneous structure, as to 
their length, the nature of their \\r and <|> substituents, the pos'rtion(s) (if any) of 
branch points, and the length, \\i and <|> substituents, and position(s) of any 

25 branches. Methods of particular interest for synthesis of such PLP moieties is 
described in, for example, PCT Publication Nos. WO 02/19963, WO 02/20033, and 
WO 00/12587 (which references refer to the compound -[NH-\|/-NH-C(0)-<|>-C(0)]- 
as -[NH-Y-NH-C(0)-X-C(0)]-). 

Thus, another preferred embodiment is one where the water-soluble 

30 polymer of the water-soluble thioester and selenoester compounds of the invention 
is mono-disperse (i.e., a molecularly homogenous composition containing a single 
and structurally defined molecular species of interest). Also, it is preferred that the 
target molecule of interest comprising an amino acid synthon be monodisperse as 
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well, for example, as peptides and proteinscan be made in total by chemical 
synthesis, they can be made mono-disperse as well. Such compounds have the 
advantage of being highly pure and avoid the problems of purification and 
analytical characterization such as when hetero-disperse polymers are employed. 

5 Such compounds are advantageous in terms of reproducible synthesis and the like 
as well, for example, a single molecular species as opposed to mixtures typical of 
polymers made by polymerization processes, e.g., polyethylene glycol (PEG). 

Accordingly, properties of the water-soluble thioesters and selenoesters of 
the invention can be modified by precisely adjusting the molecular weight, 

10 structure, hydrophilicity, hydrophobicity and charge of the water-soluble polymer. 
In particular, in addition to increasing the water hydration and molecular weight of 
a target compound of interest to improve its synthesis and handling properties, the 
water-soluble polymer of the thioester or selenoester can render the target 
. compound to have a precise molecular weight, structure, hydrophilicity, 

1 5 hydrophobicity and charge that can be adjusted for a given solution or solid phase 
system to modulate the handling properties and/or subsequent manipulation and 
properties of a target compound of interest. A preferred embodiment of the 
invention i s t hus d irected to w ater-soluble. t hioester a nd s elenoester c ompounds 
and methods that combine the above features. 

20 By way of example, a preferred water-soluble thioester and selenoester 

compound of the present invention comprises an amino acid synthon having an N- 
terminal group joined to a C-terminal group through an organic backbone, and 
comprises the formula: 



Ri Q R 

H I II H I 

Y N CH — (CH 2 )n., — C--N CH (CH 2 )n 2 — C- 

25 0jn3 0) 



wherein Y is a target molecule of interest that may be present or absent; Ri 
is hydrogen or an organic side chain; each R individually is hydrogen or an organic 
side-chain; ni and n 2 each are from 0 to 2; n 3 is from 0 to 100; X is sulfur or 
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selenium; and R 3 is any group compatible with thioesters or selenoesters that 
includes a water-soluble polymer, as described above. 

In compounds of structure (1), the group Y may comprise any molecule of 
interest including, for example, an amino acid, peptide, polypeptide, nucleic acid, 
5 lipid, carbohydrate, combinations thereof and the like. In many embodiments, Y 
groups are peptides. In a preferred embodiment, Y comprises an N-terminal group 
that may be protected or unprotected. Where the N-terminal group of the target 
molecule Y is designed as an intermediate, for subsequent chemical ligation 
reactions, Y will preferably be composed of an N-terminal amino acid that is 

10 capable of supporting chemical ligation. Examples of N-terminal amino acids 
capable of supporting chemical ligation include cysteine, or an N-alpha amino 
substituted with an auxiliary side chain bearing a thiol or selenol for general or 
extended native chemical ligation. For N-terminal ligation groups, the thiols, 
selenols or other nucleophiles may protected with protecting groups such as Acm 

15 or benzyl derivatives. Alternatively, the N-alpha amino group of may be protected 
(e.g., Boc-, Fmoc, Nsc etc.). In some instances, both the N-alpha amino and the 
side chain functionalities are protected, e.g., by forming a cleavable cyclic structure 
such as a thioproline in the case of cysteine. Where Y comprises an N-terminal 
group that is substantially non-reactive, such as a linear, branched, substituted or 

20 unsubstituted aliphatic or other capping group, then no N-terminal protecting group 
is needed. Alternatively, a reactive group may be present on the N-terminal group, 
but is generally chosen so as not to react with the C-terminal thioester or 
selenoester, except where thioester- or selenoester-mediated intramolecular 
cyclization of the compound is desired. As can be appreciated, other protecting 

25 groups may be present on the compound for subsequent elaborations, such as 
one or more residual protecting groups on the side chains R and Ri depending on 
the intended end use of the thioester or selenoester compound. Such protecting 
groups, if present can be cleavable under identical or orthogonal conditions 
relative to other protecting groups that may be present. 

30 The R and Ri groups of the structure (1) may each individually comprise 

hydrogen or any organic side chain, for example, a side chain of an amino acid. 
The group R3 may comprise any group that is compatible with a thioester or 
selenoester, where the group includes a water-soluble polymer as described 
above. As noted above, n 3 in structure (1) is from 0 to 100, and is preferably 0 to 
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100 when Y is absent, or when Y is present is typically less than 100 depending on 
the nature of Y. For example, when Y is a peptide, then the total length of the 
thioester or selenoester compound will typically be about or less than 100 amino 
acids. 

In another embodiment, and by way of example, a sterically hindered water- 
soluble thioester or selenoester compound comprising an amino acid synthon 
having an N-terminal group joined to a C-terminal group through an organic 
backbone, comprises the formula: 



10 



Ri O 

H I II 

N CH — (CH 2 )n., — C" 



H 
N- 



R 
I 

-CH- 



-(CH 2 )n 2 — C- 

II 

O 



H 

N- 



? 2 

-CH— C- 
II 

o 



-X R, 



n 4 



(2) 



wherein Y is a target molecule of interest that may be present or absent; 
each Ri, R and R2 individually is any side chain group and may be the same or 
different; ni and n 2 each individually is 0, 1 or 2; n 3 is 0 to 100; n 4 is 0 to 1; X is 

15 sulfur or selenium; and R 3 is any thioester or selenoester compatible group that 
includes a water-soluble polymer; and wherein at least one of R, Ri, R 2 and R 3 
includes a group that sterically hinders the thioester or selenoester moiety -C(O)- 
X-. For example, when n 2 and n 4 are 0, R and/or R 3 would include a sterically 
hindering group. In another example, when n^ n 3 and n 4 are 0, R and/or R 3 would 

20 include a sterically hindering group. Yet another example is where n 2 is 1 or 2, 
and n 3 is one or more, then R 2 and/or R 3 would include a sterically hindering 
group. 

In the compounds of the structure (2), n 3 and the Y group are the same as 
described above for compounds of the structure (1). Similarly, the group R, and 

25 R 2 groups of the structure (2) may each individually comprise hydrogen or any 
organic side chain, for example, a side chain of an amino acid, with the proviso 
that at least one of R, Ri, R 2 and R 3 includes a group that sterically hinders the 
thioester or selenoester moiety -C(0)-X- as described above. The group R 3 may 
comprise any group that is compatible with a thioester or selenoester, where the 

30 group includes a water-soluble polymer as described above, again with the proviso 
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that at least one of R, R 1t R 2 and R 3 includes a group that sterically hinders the 
thioester or selenoester moiety -C(0)-X- as described above. For instance, 
sterically hindering groups usable for R, Ri and R 2 include, by way of example, 
branched alkane, cycloalkane, alkyl-substituted aryl and heteroaryl groups, and 
5 combinations thereof. Such sterically hindering groups may comprise the formula 
-C(R4)(R 5 )(R6) as described above, where R4, R5 and R6 each individually includes 
hydrogen, a linear, branched, cyclic substituted or unsubstituted alky!, aryl, 
heteroaryl or benzyl group, and at least two of R4, R5 and R 6 each individually 
include a linear, branched, cyclic substituted or unsubstituted alkyl, aryl, heteroaryl 

10 or benzyl group. 

Similarly, the R 3 group of structure (2) may be selected from a group of the 
formula -C(R 7 )(R 8 )-U-Polymer, where R 7 and Ra each individually are selected 
from hydrogen and linear, branched, substituted and unsubstituted alkyl, aryl, 
heteroaryl and benzyl groups, with the proviso that when R, R 1f and R 2 are devoid 

15 of a sterically hindering group, then one or more of R 7 and Rq are selected from 
linear, branched, substituted and unsubstituted alkyl, aryl, heteroaryl and benzyl 
groups for sterically hindered thioester or sterically hindered selenoester, and 
where U is a linker or spacer that may be present or absent, and "Polymer" is a 
water-soluble polymer. The U group may include linear or branched moieties 

20 comprising one or more repeat units employed in the water-soluble polymer, 
natural or unnatural amino acids or derivatives thereof, as well as aliphatic 
moieties, including alkyl, aryl, heteroalkyl, heteroaryl, alkoxy and the like, which 
preferably contain up to 18 carbon atoms or even an additional polymer chain. 
The formula -C(R 7 )(R8)-U-Polymer also may be represented as the formula - 

25 C(R 7 )(R8)-Polymer, where U is missing or provided by the component Polymer. 
Other groups providing steric hindrance for the thioester or selenoester moiety 
may also be used. 

Synthesis of the Subject Water-soluble Thioester and Selenoester Compounds, 
30 and Generator Thereof 

The subject thioester and selenoester compounds of the invention may be 
synthesized, in general terms, by: (a) providing a thioester or selenoester 
generator comprising a support that is joined through a linker to an amino acid 
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-synthon* where the linker is cieavable under non-nucleophilic conditions, and 
where the amino acid synthon comprises a thioester or selenoester joined to a 
water-soluble polymer; and (b) cleaving the linker under non-nucleophilic 
conditions to release the amino acid synthon from the support. 
5 The amino acid synthon, when joined to the support, may comprise a target 

molecule of interest that is lacking one or more reactive functional groups. As 
used herein, the term "lacking reactive functional groups" is intended to mean a 
group or radical in which such reaGtive functional group is entirely absent, as well 
as a group or radical that contains a protected functional group that would 

10 otherwise be reactive but for the presence of the protecting group. For example, 
where the support-bound amino acid synthon and its organic backbone includes 
side chains or other moieties bearing functional groups (e.g., functional groups of a 
target peptide), one or more, and preferably all of such groups will typically be 
protected when a target molecule is synthesized on the support (e.g., SPPS). 

15 Depending on the choice of protecting groups, the cleavage reaction may provide 
. for their removal. Alternatively, one or more of the protecting groups may be 
retained if they are removable under conditions that differ from the cleavage 
reaction, i.e., orthogonal protection. 

Accordingly, a thioester or selenoester product of the above method of the 

20 invention may be fully, partially or totally unprotected following cleavage and 
release from the support, and be soluble under aqueous conditions. The organic 
backbone of the amino acid synthon may be associated with a target molecule and 
may comprise an amino acid, peptide or polypeptide with one or more side chains 
bearing protected or unprotected functional groups, and the N -terminal group of 

25 the amino acid synthon may itself comprise protected or unprotected amino acid 
groups as also described above. The N-terminal group also may be capable of 
supporting chemical ligation, and may comprise an amino acid with a protected or 
unprotected side chain functionality capable of participating in native chemical 
ligation, extended chemical ligation or other ligation technique to form an amide or 

30 other bond. 

Cleavage of the linker to form the freed thioester or selenoester compound 
may be carried out under various conditions according to the nature of the linker 
used a nd t he o rthogonality of p rotecting g roups p resent i n the composition with 
respect to the linker. Where an N-terminal protecting group is present, cleavage of 

20 
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the linker may be carried out under conditions orthogonal to removal of the N- 
terminal protecting group, as well as orthogonal to any protecting groups for side 
chain groups associated with the amino acid synthon, such that the freed thioester 
or selenoester compound is fully or partially protected. Such orthogonal conditions 
5 may include, for example, linker cleavage under acid conditions where the N- 
terminal protecting group is acid stable. Linker cleavage may alternatively involve 
non-orthogonal conditions that also result in removal of the N-terminal protecting 
group and/or one or more amino acid side chain protecting groups that may be 
present on the organic backbone, such that the freed thioester or selenoester 

10 compound is partially protected or unprotected all together. Selection of various 
protecting groups and orthogonality of removal of protecting groups with respect to 
linker cleavage may be made based on desired synthetic schemes and solubility 
characteristics for the freed thioester or selenoester compounds. 

In the subject thioester or selenoester generators of the invention, the linker 

15 may be joined to the amino acid synthon at any one of several positions. For 
instance, the linker may be joined to the backbone (e.g., backbone heteroatom 
such as nitrogen) or side chain (e.g., pendant functional group of a side chain such 
as an amino or carboxylate group) of the amino acid synthon. Alternatively, the 
linker may be joined to the water-soluble polymer of the amino acid synthon. 

20 Moreover, the linker may comprise any cleavable group capable of anchoring the 
amino acid synthon to the support material, with the proviso that the linker is 
cleavable under non-nucleophilic conditions. Cleavage under non-nucleophilic 
conditions is important and desirable as nucleophiles can destroy the thioester or 
selenoester. 

25 The use of linker groups in solid phase synthesis is well known, and various 

linker groups are usable with the invention. For instance, the linker may be 
bifunctional, and may serve as a spacer with a cleavable group on one end, and a 
group such as a carboxyl g roup at the other end that can be activated to allow 
coupling to a functionalized support material. The linker can be a preformed linker 

30 4 or may be prepared on a support material. 

Suitable linkers include, for example, PAL, XAL, PAM, RINK, SCAL and 
Sieber-based linker systems (e.g., PAL (5-(4'-aminomethyl-3\5'- 
dimethoxyphenoxy)valeric acid, XAL (5-(9-aminoxanthen-2-oxy)valeric acid), 4- 
(alpha-aminobenzyl)phenoxyacetic acid, 4~(alpha-amino-4'- 
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methoxybenzyl)phenoxybutyric acid, p-alkoxybenzyl (PAB) linkers, photolabile o- 
nitrobenzyl ester linkers, • - , 4-(alpha-amino-4'-methoxybenzyl)-2- 
methylphenoxyacetic acid, 2-hydroxyethyisulfonylacetic acid, 2-(4- 
carboxyphenylsulfonyl)ethanol, (S^'-aminomethyl-S'^'-dimethoxyphenoxyJvaleric 
5 acid) linkers, WANG hydroxymethyl phenoxy-based linkers, RINK 
trialkoxybenzydrol and trial koxybenzhydra mine linkers, PAM 
phenylacetamidomethyl, SCAL-type safety catch acid labile linkers and Sieber 
aminoxanthenyl linkers). Derivatives of such linkers and other linker systems may 
also be used. T hese I inker s ystems are cleavable under well known acidolysis 
10 conditions (typically trifluoroacetic acid (TFA) or hydrogen fluoride (HF)), UV 
photolysis (A« 350 nm) conditions or catalytic hydrogenation conditions. Several of 

i 

the above linker systems are commercially available as pre-formed on resin and 
glass supports. 

In the subject generators, the linker is covalently anchored to a support as 

15 described further below. Suitable supports may comprise, for example, matrices, 
surfaces, resins or other solid phase supports that are compatible with peptide 
synthesis or other synthetic schemes associated with a target molecule. The 
support may comprise a functionalized glass, an organic polymer or other 
compatible material. As such, the support of the thioester and selenoester 

20 generators of the invention comprises a solid phase, matrix or surface compatible 
with organic synthesis strategies. Preferable supports are those compatible with 
peptide synthesis. A variety of such supports are well known, and can be 
employed, including those described in further detail herein. Examples include . 
supports or resins comprising cross-linked polymers, such as divinylbezene cross- 

25 linked polystyrene polymers, or other organic polymers that find use for solid 
phase organic or peptide synthesis. Controlled porous glass (CPG) supports are 
another example. In general, the most preferred supports are stable and possess 
good swelling characteristics in many organic solvents. 

The above and other suitable solid supports, linkers, their preparation and 

30 application are described in, for example, "Advanced Chemtech Handbook of 
Combinatorial & Solid Phase Organic Chemistry," W.D. Bennet, J.W. Christensen, 
L.K. Hamaker, M.L Peterson, M.R. Rhodes, and H.H. Saneii, Eds., Advanced 
Chemtech, 1998, and elsewhere (See, e.g., G.B. Fields et ah, Synthetic Peptides: 
A User's Guide, 77-183, G.A. Grant, Ed., W.H. Freeman and Co., New York, 1990; 
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NovaBiochem Catalog, 2000; "Synthetic Peptides, A User's Guide," G.A. Grant, 
Ed., W.H. Freeman & Company, New York, NY, 1992; "Principles of Peptide 
Synthesis, 2nd ed.," M. Bodanszky, Ed., Springer-Verlag, 1993; "The Practice of 
Peptide Synthesis, 2nd ed.," M. Bodanszky and A. Bodanszky, Eds., Springer- 
5 Verlag, 1994; "Fmoc Solid Phase Peptide Synthesis, A Practical Approach," W.C. 
Chan and P.D. White, Eds., Oxford Press, 2000). 

By way of example, preferred thioester or selenoester generators of the 
invention comprise the formula: 



— X R 3 Lj Support 

n 3 



wherein PG is a protecting group that maybe present or absent, Y is a 
target molecule of interest and may be present or absent, and when Y is absent 

15 PG is an amino protecting group that may be present or absent; Ri and each R 
individually is hydrogen or an organic side chain; R 3 is a water-soluble polymer; X 
is sulfur or selenium; ni and n 2 each are from 0 to 2; n 3 is from 0 to 100; each L 1s 
L 2 and L 3 is a linker cleavable under non-nucleophilic conditions; and Support is a 
solid phase, matrix or surface; and wherein only one of U, L 2 , and L 3 is present. 

20 Compositions of formula (3) also embody thioester and selenoester 

generators for production of the sterically hindered water-soluble thioesters and 
selenoesters of the invention. Such generators comprise the formula: 



Support 



PG- 



i' t 

-N CH — (CH 2 )n 1 — C- 
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N CH- 



-(CH 2 )n 2 — C- 
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Support 



PG Y— N CH— (CH 2 )rv, — C-|-N CH— (CH 2 )n 2 —C- 

O 



Jn 3 



Support 



II 



R 2 

H S 

N — CH— C-f-X R 3 — Support 



n 4 



(4) 



wherein PG, Y, R, R 1f R 3> X, n 1f n 2 , n 3 , L if L 2 , L 3 and Support are the same 
5 as described above for the structure (3); and where n 4 is from 0 to 1 ; and R 2 is 
hydrogen or an organic side chain; and wherein only one of L if L 2> and L 3 is 
present. 

As shown in formulas (3) and (4), the amino acid synthon comprising the 
water-soluble thioester or selenoester compound may be joined to the support 

10 through any one several positions, i.e., at a backbone nitrogen (L 3 ), side chain (L 2 ) 
or to the water-soluble polymer (Li). Hence, only one Li, L 2 , and L 3 is present. 
For example, when linker Li is present the water-soluble polymer group R 3 of the 
amino acid synthon is joined to the support through linker Li (i.e., -R 3 -L r Support), 
whereas the groups L 2 -Support and L 3 -Support are missing. Cleavage from the 

15 support produces compounds embodied in the structures (1) and (2) as described 
above. 



Synthesis of the Water-Soluble Polymer Thioester or Selenoester Generators 

The thioester and selenoester generators employed to produce the water- 
20 soluble thioester and selenoester compounds may be prepared using any 
convenient synthesis protocol. One type of representative and convenient protocol 
for synthesis of the generators is a non-nucleophile based synthesis scheme or 
protocol. Another type of representative and convenient protocol for synthesis of 
the generators is a nucleophile-based synthesis scheme or protocol. Such 
25 schemes, protocols or approaches are particularly useful where the synthesis of a 
target molecule of interest, such as a peptide or polypeptide is prepared by Boc- 
SPPS (non-nucleophilic stratagem) or Fmoc- or Nsc-SPPS (nucleophilic 
stratagem). The method may be employed to make generators for the production 
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of sterically hindered and non-hindered thioesters and selenoesters according to 
the invention. 

For the non-nucleophilic approach, a preferred method is exemplified below 
in Scheme 1 for forming the amino acid synthon structure (5): 

Scheme 1 M — Support 

(R 3 (direct attachment or on-resin . . 

stepwise chain assembly) 

R 3 L| Support 

acyl-X-OSu 



Ri O 

I II 

PG-, — N— CH — (CH 2 )ni — C — X OSu 

H 



acyl-X— R 3 — L 1 — Support 

Ri O 

I II 

PGi — N — CH — (CH 2 )rii — C — OSu 
H 



Ri u 

I II 

PG 1 — N— CH — (CH 2 )n^— C— X R 3 — L| — Support 

H 



(5) 



As shown in Scheme 1, water-soluble group R3 is attached to a linker- 
10 functionalized support to generate R 3 -Li-Support, where R 3 , Li and Support are as 
described above for the structures (3) and (4). An amino acid synthon is coupled 
to the water-soluble polymer via one of two approaches. In one method, an 
activated acylthioester or acylselenoester (denoted acyl-X-OSu) is coupled first, 
wherein the R 3 -Li-Support is coupled to the sulfur or selenium, as denoted by X. 
15 This is followed by removal of the acyl moiety (e.g., piperdine cocktail containing a 
thiol or selenol reducing reagent), and coupling of an activated amino acid that has 
its N-terminal alpha amino group protected with a protecting group PG1 that is 
removable under non-nucleophilic conditions (e.g., a Boc group), where X is sulfur 
or selenium, and where OSu represents an activated ester moiety (e.g., an OPfp 
20 pentafluorophenyl ester). I n another method, a preformed activated ester of an 
amino acid thioester or selenoester (denoted PGi-NH-CH(R 1 )-(CH 2 )n 1 -C(0)-X- 
Osu) is coupled directly to R 3 -Li-Support. Preferably, the activated ester of the 
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amino acid thioester or selenoester is sterically hindered. As depicted, Ri is a side 
chain that may be protected, depending on the side chain, with a group that is 
removable under non-hucleophilic conditions orthogonal to PGi, and where ni is 
from 0 to 2. As can be appreciated, dipeptides or larger peptides can be utilized 
for the process as well. Also, the first amino acid chosen for bearing the thioester 
or selenoester component will typically be selected based on the target molecule 
that one wishes to construct. For example, where a peptide is the target molecule, 
and the amino acid sequence of the desired peptide is NH2-GIy-Gly-Ser-thioester- 
polymer, then a PGi-protected serine is the first amino acid coupled to the 
polymer-modified support. 

The amino acid synthon may also be attached to a support through the 
alpha-nitrogen or the side chain of the structure (5) through either L2-Support or L 3 - 
Support, respectively, as described above. In this situation, on-resin 
thioesterfication or selenoesterfication reactions analogous to those described 
below for the nucleophilic approaches may be employed except that the thioester 
or selenoester may be attached before, during or after a target molecule of interest 
is synthesized, since nucleophiles can be excluded from the reaction. However, 
the above method depicted in Scheme 1 is preferred. 

The structure (5) may be cleaved to generate a water-soluble thioester or 
se | enoes ter suitable for solution syntheses, or further elaborated as shown in 
Scheme 2 below to form the structures (6) and (7), where PGi, R, Ri, R 2 , R3, X, 
ni, n 2 , n 3> n 4 , Li and Support are the same as described above for the structures 
(3), (4) and (5) above. 
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Scheme 2 
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The structures (5), (6) and (7) represent preferred embodiments of the structures 
(3) and (4) described above, and can be elaborated further or cleaved to form a 
thioester or selenoester compound of the invention. 

For the nucleophilic approach, thioesterfication or selenoesterfication 
reactions, that are analogous to those described above, are employed except that 
the thioester or selenoester is attached only after the last nucleophile-based 
manipulation has been performed. This is necessary as nucleophiles can destroy 
the thioester or selenoester component prematurely. A preferred embodiment of 
the non-nucleophilic approach is exemplified below in Schemes 3A, 3B and 3C, 
which initially employ the compositions of the structures (8) and (9). 
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or 
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L 2 -^—Support 
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Referring to structures (8) and (9), PG 2 is a nucleophile-labile protecting 
5 group; Y is a target molecule of interest that may be present or absent; L 2 is a 
nucleophile-stable linker; R1 is a divalent radical lacking reactive functional groups; 
each R and R 2 individually is hydrogen or any organic side chain lacking reactive 
functional groups; n-i and n 2 each are from 0 to 2; n 3 is from 0 to 20; n 4 is 0 to 1; 
PG 3 is any protecting group that is removable under conditions orthogonal to 
10 removal of PG 2 and cleavage of L 2 ; and Support is a solid phase, matrix or 
surface. 

As described above, the preferred support is compatible with solid phase 
organic synthesis (SPOS) or solid phase peptide synthesis (SPPS). The preferred 
nucleophile-stable linkers are removable under acidic conditions as provided by 

15 trifluoroacetic acid (TFA) or anhydrous hydrogen fluoride (HF), under catalytic 
conditions in the presence of H 2 , or by other mechanism such as light (e.g., UV 
photolysis). The amino acid synthon will preferably be composed of an amino acid 
having a side chain anchored to the support through the linker, and may be 
provided in the initial composition as a single amino acid residue, peptide, or an 

20 organic composition containing an amino acid component, peptide or residue 
thereof. As also noted above, the organic backbone is lacking reactive functional 
groups. In most instances, protecting groups, if present on the organic backbone, 
are preferably selected so as to be removable under the same conditions as the 
linker. However, protecting groups can be selected that provide an additional level 

25 of orthogonality when site-specific modifications to the organic backbone are 
desired during or after synthesis. Components that bear functional groups that are 
substantially unreactive under the synthesis conditions may also be unprotected, 
thus rendering the organic backbone as one lacking reactive functional groups. 

The R1 group includes a radical based on an amino acid side chain or 

30 derivative thereof that has a functionality capable of covalently binding to the linker 
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L 2 . For example, preferred Ri groups comprise a side chain of an amino acid 
selected from aspartic acid, glutamic acid, glutamine, lysine, serine, threonine, 
arginine, cysteine, histidine, tryptophan, tyrosine and asparagine. 

The protecting group PG 2 may comprise any of a variety of nucleophile- 
5 labile protecting groups. As noted above, the particular protecting group PG 2 may 
be selected based on the particular molecule of interest or target molecule, 
compatibility with other protecting groups or functionalities that will be present 
during synthesis, or other considerations. The protecting group PG 3 may comprise 
any group capable of protecting a carboxyl group and is orthogonal to the 

10 nucleophile-labile protecting group PG 2 and the nucleophile-stable linker L, as 
discussed above. Exemplary protecting groups PG 3 and PG 2 fitting these criteria 
include allyl and ODmab groups for the C-terminal carboxyl protection, Fmoc and 
Nsc when the N-terminal group is an amine, and where a suitable linker would be 
one cleavable under acidic conditions. For instance, allyl groups are stable to 

15 nucleophiles, yet are removable by palladium-catalyzed hydrogenation. ODmab 
groups can be removed with hydrazine, which is a very strong nucleophile, but are 
stable to typical conditions employed for removal of most other nucleophile-labile 
protecting groups, such as N-terminal amino protecting groups Fmoc and 2-(4- 
nitrophenylsulfonyl)ethoxycarbonyl (Nsc). For instance, Fmoc and Nsc groups are 

20 readily removed by piperidine, which is a much weaker nucleophile compared to 
hydrazine. This difference in stability provides the appropriate level of 
orthogonality. 

Compositions of the structures (8) and (9) are easily extensible using 
conventional Fmoc-based or Nsc-based solid phase organic or peptide synthesis 

25 (i.e., SPOS or SPPS) techniques, and provide for a "side chain"-based anchoring 
during synthesis for elaborating a target molecule of interest Y. For instance, 
structures (8) and (9) can be employed in a variety of nucleophile-based chain 
elongation synthesis schemes involving repeated cycles of nucleophilic 
deprotection and coupling with incoming compounds bearing a reactive moiety and 

30 PG 2 , as illustrated below for structure (8) in Scheme 3A. 
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Scheme 3A 
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The nucleophile-labile protecting group PG 2 is then selectively removed 
under nucleophilic conditions to reveal a deprotected functional reactive group. 
5 For instance, where PG 2 is a nucleophile-labile amino protecting group, and the 
pendant N-terminal group of Y is an amine, PG 2 can be Fmoc or Nsc, and removal 
thereof can be carried out under basic conditions that do not remove PG 3 . The 
deprotected N-terminal reactive functional group is then coupled to a compound of 
interest bearing a single reactive moiety capable of forming a covalent bond 

10 therewith. Various compounds can be employed in this step, depending on the 
intended end use. The compound of interest may include a non-reactive group, 
which requires no further protection, or can be one that includes one or more 
reactive functional groups that need protected. In the latter instance, the 
protecting group is selected to be orthogonal to removal of PG 3 . For example, in 

15 the case of Fmoc-SPPS the compound of interest can be a final Boc-protected 
amino acid where the PG 3 group is allyl or ODmab. The product of this step of the 
process is illustrated below for structure (10) in Scheme 3B, which generates the 
structure (11), where PG 4 is a protecting group on the compound of interest that 
may be present of absent, and when present is removable under conditions 

20 orthogonal to PG 3 . 
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Scheme 3B 
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10 



After coupling of the compound of interest, the C-terminal carboxyl 
protecting group PG 3 is selectively removed to generate a free carboxyl group. 
Conditions for removing the carboxyl protecting group are chosen based on the 
protecting group employed. For instance, where an ally group is employed, 
palladium-catalyzed hydrogenation can be used, or where an ODmab group is 
employed, the appropriate hydrazine cocktail can be used. The free carboxyl 
group is then converted to a thioester or selenoester bearing a water-soluble 
polymer, which is illustrated below in Scheme 3C for structure (11) in the 
generation of structure (12), where X is sulfur or selenium, and R 3 is as described 
above. 
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The structure (12) represents a preferred embodiment of the structures (3) 
and (4) described above, and the same approach may be applied to generate 
sterically hindered thioester and selenoester compounds as well. Moreover, the 
5 thioester or selenoester compound can be elaborated further on-resin or cleaved 
to form a thioester or selenoester compound of the invention. 

As noted above, preferred side-chains are those provided by an amino acid 
such as aspartic acid, glutamic acid, glutamine, lysine, serine, threonine, arginine, 
cysteine, histidine, tryptophan, tyrosine and asparagine, as well as their derivatives 

10 and analogs thereof. For example, glutamic acid can be used in a side chain 
anchoring strategy for generating a target peptide thioester or selenoester via 
Fmoc-SPPS. In this instance, a single glutamic acid can be used as the precursor 
where the side chain acid is coupled to an Fmoc-compatible carboxyl-generating 
linker (e.g., WANG (HMPA), RINK Acid etc.) to a support, an Fmoc group protects 

15 the N-alpha amine, and an allyl group protects the O-alpha carboxyl. The desired 
number of cycles of Fmoc SPPS are carried out as described above, so that a 
protected peptide is "grown" or otherwise formed in the N- to C-terminal direction 
from the N-alpha amine of the amino acid joined to the linker, to provide a 
protected peptide joined or anchored to the linker and support via* the glutamate 

20 side chain. The O-alpha carboxyl allyl protecting group can then be removed from 
the anchored protected peptide under H 2 /palladium catalyst conditions (e.g., 
Pd(Ph 3 )4/PhSiH 3 ) in dichloromethane (DCM). Following removal of the allyl 
protecting group, the O-alpha carboxyl can then be activated (e.g., using HATU (N- 
[(dimethylamino)-l H-1 , 2, 3-triazol [4, 5-b] pyridiylmethylene]-N- 

25 methylmethanaminium hexafluorophosphate N-oxide). The anchored, protected 
peptide with the activated O-alpha carboxyl can then be reacted with a preformed 
amino acid thioester-water soluble polymer or salt thereof (e.g., the trifluoroacetic 
acid (TFA) salt form) to produce a target peptide thioester anchored to the linker 
and support. In this example, the backbone side chains of the peptide are 

30 protected with acid-labile protecting groups. An acid such as TFA may then be 
used to cleave the linker and release the target peptide thioester-water soluble 
polymer from the support and remove the acid-labile side chain protecting groups 
from the target. 
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Another s pacific e xample i llustrating t he s ide c hain a nchoring of an i nitial 
amino acid to a support prior , to formation of a target water-soluble peptide 
thioester-polymer is lysine. For example, a support bound WANG linker can be 
treated with N^'-disiiccinimidyl carbonate (DSC) / 4-dimethylaminopyridine 
5 (DMAP) in N,N-dimethyl formamide (DMF) to activate the linker for coupling. The 
activated linker is then treated with the TFA salt of Na-Fmoc-Oa-allyl lysine, in 
N,N-diisopropylethylamine (DIEA) / DMF, to form a linker with a urethane group 
made with the lysine side chain e-amino group. The lysine residue thus anchored 
by its side chain provides an initial basis for Fmoc-based SPPS synthesis, which 

10 can be carried out to generate a peptide by stepwise growth in the N- to C-terminal 
direction from the N-alpha amine as described above. Once the desired peptide is 
formed with the desired number of cycles of Fmoc-SPPS, the N-alpha amine can 
be protected with a nucleophile-stable protecting group such as a Boc-protected 
amino acid, and then the O-alpha carbonyl deprotected using H 2 /Pd 

15 (Pd(Ph 3 )4/PhSiH 3 ). The free O-alpha carbonyl may then be activated using 7- 
azabenzotroazoI-1-1 yloxtris (pyrrolidine) phosphonium hexafluorophosphate 
(PyAOP) in DIEA / DMF, and reacted with preformed acyl-thioester water-soluble 
polymer to produce an anchored target peptide thioester. The target peptide 
thioester can then be cleaved from the support by treatment with TFA cocktails to 

20 yield the free target water-soluble peptide thioester-polymer. 

The above-described nucleophilic strategy for side-chain anchored amino 
acid synthons can be applied to backbone anchored amino acid synthons as well. 
The main difference here is the use of a linker that is attached to the backbone 
alpha nitrogen of an amino acid synthon for coupling to the support. Several 

25 linkers for this approach are well known and can be used (Barany et aL, J. Org. 
Chem. (1999) 64(24):8761-8760; I shi et aL, B iosci. B iotechnol. B iochem. (2002) 
66(2):225-232). To improve yields in the backbone approach, care should be 
taken to avoid aminolysis in the first on-resin coupling reaction, which can occur 
when the initial coupling employs a single amino acid capable of forming a 5- 

30 member or 6-member cyclic side product via aminolysis of the resin-supported 
amino acid. For example, aminolysis can be avoided by using a dipeptide or larger 
in the first coupling, and/or using a resin that employs an amino acid synthon 
comprising a tri-peptide or larger, or a combination thereof. 
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The organics, equipment, supports, amino acids and diversity components, 
linkers, and protecting groups finding use in the above-described aspects of the 
invention can be obtained from a variety of commercial sources, prepared de novo, 
or a combination thereof. Moreover, the reagents and other materials employed 
5 for the method, as well as alternative components will be apparent to one of 
ordinary skill in the art (See, e.g., "Protecting Groups in Organic Synthesis", 3rd 
Edition, T.W. Greene and P.G.M. Wuts, Eds., John Wiley & Sons Inc., 1999; 
NovaBiochem Catalog 2000; "Synthetic Peptides, A User's Guide," G.A. Grant, Ed., 
W.H. Freeman & Company, New York, NY, 1992; "Advanced Chemtech Handbook 

10 of Combinatorial & Solid Phase Organic Chemistry," W.D. Bennet, J.W. 
Christensen, L.K. Hamaker, M.L Peterson, M R. Rhodes, and H.H. Saneii, Eds., 
Advanced Chemtech, 1998; "Principles of Peptide Synthesis, 2nd ed.," M. 
Bodanszky, Ed., Springer-Verlag, 1993; 'The Practice of Peptide Synthesis, 2nd 
ed.," M . B odanszky a nd A . B odanszky, E ds., S pringer-Verlag, 1 994; " Protecting 

15 Groups," PJ. Kocienski, Ed., Georg Thieme Verlag, Stuttgart, Germany, 1994; 
"Fmoc Solid Phase Peptide Synthesis, A Practical Approach," W.C. Chan and P.D. 
White, Eds., Oxford Press, 2000, and elsewhere). 

Utility 

20 The subject water-soluble selenoester and thioester compounds, (and 

generators thereof in certain embodiments) as described a bove, a re particularly 
useful in peptide and polypeptide synthesis techniques that employ thioester 
and/or selenoester-mediated chemical ligation. Such protocols/applications are 
well known to those of skill in the art, and described in WO 98/56807; WO 

25 02/18417; and WO 02/20557 (the United States Provisional Priority Applications of 
which are herein incorporated by reference) as well as in United States Patent No. 
6,326,468; the disclosure of which is herein incorporated by reference. 

Kits 

30 Given the broad range of use, the subject thioester and selenoester 

generators and compounds also may be provided in kits and the like. As such, 
also provided are kits for use in the subject invention, where such kits may 
comprise containers, each with one or more of the various reagents utilized in the 
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methods, including, for example, -water soluble polymer functionalized solid 
supports, blocked and/or activated monomers, activators, deblockers, buffers, 
solvents and the like 

The kits may further include instructions for using the kit components in the subject 
5 methods. The instructions may be printed on a substrate, such as paper or plastic, 
etc. As such, the instructions may be present in the kits as a package insert, in the 
labeling of the container of the kit or components thereof (i.e., associated with the 
packaging or sub-packaging) etc. In other embodiments, the instructions are 
present as an electronic storage data file present on a suitable computer readable 
10 storage medium, e.g., CD-ROM, diskette, etc. 

Illustrative Exemplary Embodiments 

The above-described embodiments of the invention will be more fully 
understood by reference to additional specific examples, compositions and 

15 reaction schemes shown below and in the Figures. 

Referring now to Figure 1, the structure of (1A) illustrates a general 
thioester or selenoester composition of the invention exemplified by a peptide- 
thioester-polymer, or peptide-selenoester-polymer. I n particular, the structure of 
(1A) depicts a peptide denoted "Peptide" coupled to a water-soluble polymer 

20 denoted "Polymer" through an alpha-amino acid thioester or selenoester, where X 
is sulfur or selenium, and where R is a side chain of an amino acid. Figure 1 also 
shows structures (1B) through (1D) that illustrate three different general 
embodiments in accordance with the invention that include structure (1A) attached 
to a support resin in different configurations and preferred thioester or selenoester 

25 generators for production of the structure (1A). In particular, structure (1C) 
illustrates a water-soluble compound of structure (1 A) anchored to a support via a 
side-chain linker. Structure (1D) illustrates a water-soluble compound of the 
structure (1A) anchored to a support via a backbone nitrogen linker system. 
Structure (1B) illustrates a water-soluble compound of structure (1A) anchored to a 

30 support via a linker attached to the water-soluble polymer. In reference to 
structures (1B) through (1D) of Figure 1, Peptide, X, Polymer, L and Support are 
as described above for structure (1 A). 
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Referring now to Figure 2, a water-soluble peptide thioester-polymer or 
selenoester-polymer (such as in Figure 1 (1 A)) is joined by native chemical ligation 
to another peptide bearing an unprotected N-terminal cysteine. In this example, the 
water-soluble peptide thioester-polymer or selenoester-polymer bears an Acm- 
5 protected N-terminal cysteine. Ligation of the two peptides forms a native amide 
bond at the ligation site and displacement of the thioester-polymer or selenoester- 
polymer component. The Acm of the ligated product may then be removed for 
additional ligation reactions. 

The following specific example illustrates a preferred embodiment in which 

10 generator structure (1B) of Figure 1 is constructed to form a thioester generator of 
the invention, and utilized to produce a peptide-thioester-polymer embodied in 
Figure 1 (1A), in which the polymer component is a water-soluble polyamide 
polymer of discrete length. In particular, a target molecule of interest comprising 
the thioester water-soluble polymer of structure (13) can be generated on an 

15 amide-generating resin such as a NH 2 -Leu-PAM-resin, where Y is a target 
molecule of interest bearing an N-terminal amino group, R' is the side chain of an 
amino acid or derivative thereof, Y is from 1 to 2, and where V is 1 then R 7 and R 8 
are both hydrogen for a non-sterically hindered thioester, and are, for example, - 
CH(CH 3 ) 2 and hydrogen, or -CH 3 and -CH 3f respectively, for a sterically hindered 

20 thioester, and "n" is a discrete integer from 2 to 12. 




Specifically, structure (13) can be produced by coupling a desired water- 
25 soluble polyamide polymer to the NH 2 -Leu-PAM resin via stepwise chain assembly 
or by attachment in its full linear form. This process is illustrated below in 
Schemes 4 and 5 for the stepwise chain assembly of a water-soluble thioester- 
polymer compound. 
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Scheme 4 
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Structure (14), where n is the number of cycles of polymer synthesis, is then 
coupled to a preformed acylthioester (15), produced, for example, as the OPfp 
5 ester as follows: 



o 

A, 
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A 0 ^7 , R 8 
s^V OH 



PfpOH 



DCC 



f (15) 



10 



15 



The acyl of resulting acylthioester composition (16) may then be removed, e.g., 
piperdine cocktail containing B-mercaptoethanol, which is then suitable for coupling 
the active ester of an initial amino acid or other compound, such as an initial Boc- 
protected amino acid as shown in the structure (17), followed by Boc-SPPS to 
generate a resin-bound target molecule of interest, as shown in structure (18). 
Cleavage of (18) in HF yields compound (13). 
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10 



Where the last coupled amino acid of structure (18) prior to cleavage is an Acm- 
protected cysteine, the structure (19) can be produced. Where an internal amino 
acid contains an orthogonally protected side-chain group, e.g., Fmoc-protected 
side chain of lysine (K), and/or post-elongation modifications that form an 
unprotected side-chain functional group that is substantially unreactive under 
cleavage conditions, e.g., aminooxy-functionalized side chain of lysine, various 
combinations of the structure (20) can be produced. 
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H O R 7 Rs 



(19) 



j O ^ S(ACm) R' O 

H 0 R 7 R 8 
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(20) 



The above structures (18), (19) and (20) are but a few of a variety of reprehensive 
examples illustrating the flexibility of the invention and the broad ranges of use of 
15 the subject thioester and selenoester generators and compounds of the invention. 
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All publications and patent applications mentioned in this specification are 
herein incorporated by reference to the same extent as if each individual 
publication or patent application was s pecifically a nd individually indicated to be 
incorporated by reference. 
5 Having now generally described the invention, the same will be more readily 

understood through reference to the additional following examples, which are 
provided by way of illustration, and are not intended to be limiting of the present 
invention, unless specified-^ 

10 Examples 

The following examples are put forth so as to provide those of ordinary skill 
in the art with a complete disclosure and description of how to make and use the 
present i nvention, a nd a re n ot i ntended to I imit t he s cope of w hat t he i nventors 
regard as their invention nor are they intended to represent that the experiments 

15 below are all or the only experiments performed. Efforts have been made to 
ensure accuracy with respect to numbers used but some experimental errors and 
deviations should be accounted for. Unless indicated otherwise, parts are parts by 
weight, molecular weight is weight average molecular weight, temperature is in 
degrees Centigrade, and pressure is at or near atmospheric. 

20 The following experimental examples provide for the synthesis of certain 

peptides using tert-butyloxycarbonyl- (Boc)-compatible synthesis, including Boc- 
based solid-phase peptide synthesis (SPPS). Those skilled in the art will 
recognize, however, that the water-soluble thioester and selenoester generators 
and related methods of the invention may be applied in 9- 

25 fluorenylmethoxycarbonyl-(Fmoc)-compatible synthesis, including Fmoc-based 
SPPS, as well as combinations of Fmoc- and Boc-compatible synthesis. 
Additional embodiments include 2-(4-nitrophenylsulfonyl)ethoxycarbonyl (Nsc), 
allyloxycarbonyl (Alloc), and other protection schemes compatible with SPPS and 
nucleophile-labile thioesters and selenoesters. The experimental examples also 

30 utilize peptide synthesis involving chain extension from an N-terminus. Those 
skilled in the art will recognize that peptide synthesis involving extension from the 
C-terminus may also be carried out using the invention. Thus, it should be 
understood that the invention is not limited to the particular embodiments 
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10 



described below, as variations of these embodiments may be made and still fall 
within the scope of the appended claims. 

Moreover, the following experimental examples provide a detailed 
description of the Boc-based solid phase synthesis of certain specific peptide 
thioester compounds. Those skilled in the art will recognize that the same or 
similar procedures described below may be used to synthesize numerous types of 
thioester and thioester compounds and generators thereof. The selenium based- 
chemistry associated with selenoester formation is well known in the art and, 
where appropriate, may be substituted. Table 1 provides a list or glossary of 
abbreviations used in the following experimental examples. 



Acm 
Alloc 
15 BOP 

Br.CI Z 

DCM 

DDE 

20 DIPCDI 
DIPEA 
DMAP 
DMF 
DMSO 

25 EtOH 
Fmoc 
FM 
HATU 

30 HBTU 



HF 



Table 1 

acetamidomethyl 
allyoxycarbonyl 

benzotriazol-1-yloxytris (dimethylamino) phosphonium 

hexafluorophosphate 

Br.CI Benzylcarbamate 

dichloromethane 

4,4-dimethyl-2,6-dioxocycloex 1 -ylidene 

N, /V-diisopropylcarbodiimide 

N, /V-diisopropylethylamine 

4-dimethylaminopyridine 

A/,/S/-dimethylformamide 

dimethylsulfoxide 

ethanol 

9-fluorenylmethoxycarbonyl 
9-Fluorenyl methyl 

(A/-[(dimethylamino)-1H-1, 2, 3-triazol [4, 5-b] pyridiylmethylene]-A/- 
methylmethanaminium hexafluorophosphate A/-oxide). 
A/-[(1 -H-benzotriazol-1 -yl)(dimethylamine) methylene]-A/- 
methylmethanaminium hexafluorophosphate N-oxide previously 
named 0-(benzotriazol-1-yl)-1 , 1,3, 3-tetramethyluronium 
hexafluorophosphate 
hydrofluoric acid 
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HMP resin 4-hydroxymethylphenoxy resin; palkoxybenzyl alcohol resin; or Wang 



resin 

HOAt 1-hydroxy-7-azabenzotriazole 

HOBt 1-hydroxybenzotriazole 

5 IP10 Interferon-gamma inducible protein 10 kDa 

Mbh dimethoxybenzhydryl 

MBHA resin 4-methylbenzhydrylamine resin 

Meb p-MethylBenzyl 

MMA N-methylmercaptoacetamide 

10 Mmt p-Methoxytriityl 

Mob p-MethoxyBenzyl 

Msc 2-Methylsulfoethylcarbamate 

Msz 4-Methylsulfinylbenzylcarbamate 

Mtr 4-methoxy-2, 3, 6-trimethylbenzene sulfonyl 

15 NMM A/methylmorpholine 

NMP /V-methylprrolidone, A/-methyl-2-pyrrolidone 

Nsc 4-nitrophenylethylsulfonyl-ethyloxycarbonyl 

OPfp pentafluorophenyl ester 

OfBu ferf-butyl ester 

20 PAC peptide acid linker 

PAL peptide amide linker 

Pbf 2, 2, 4, 6, 7-pentamethyldihydrobenzofuran-5-suIfonyl 

PEG-PS polyethylene glycol-polystyrene 

Picolyl methyl-pyridyl 

25 Pmc 2, 2, 4, 6, 8-pentamethylchroman-6-sulfonyl 

PyAOP 7-azabenzotroazol-1-1yloxtris (pyrrolidino) phosphonium 
hexafluorophosphate 

S-fl3u ferf-butyl-thio 

Tacam Trimethylacetamidomethyl 

30 ffioc fe/f-butyloxycarbonyl 

TBTU 0-(benzotriazol-1-yl)-1 ,1,3, 3-tetramethyl uronium tetrafluoroborate 

fBu ferf-butyl 

TFA trifluoroacetic acid 

Tis Trisisopropylsilane 
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Troc 



Trt 



Tmob 



TMOF 



2, 4, 6-trimehoxybenzyl 
trimethylorthoformate 
2,2,2Trichloroethylcarbamate 
triphenylmethyl 



5 



Example 1 : General Materials and Methods 

Peptide Synthesis 

Water-soluble thioester peptides are synthesized on a thioester-generating 
resin described in Example 2, whereas standard peptide thioesters are 

10 synthesized on a standard thioester generating resin (Hackeng, et al., PNAS 
(1999) 96:10068-10073). Both the water-soluble and standard thioester peptides 
are synthesized following the in situ neutralization protocol for Boc 
(tertbutoxycarbonyl) chemistry and stepwise solid phase peptide synthesis (SPPS) 
using established SPPS, side-chain protection and thioester-resin strategies 

15 (Hackeng, et al., PNAS (1999) 96: 10068-10073; and Schnolzer, et al M Int. J. Pept. 
Prot. Res., (1992) 40: 180-193)) on an ABI433A automated peptide synthesizer or 
by manual chain assembly, or ordered and acquired from commercial vendors. 
For instance, a standard set of Boc SPPS protecting groups are used, namely: 
Arg(Tos); Asp(cHex); Cys(4MeBzl) & Cys(Acm); Glu(cHex); His(DNP); Lys(CIZ); 

20 Ser(Bzl); Thr(Bzl); Trp(formyl); Tyr(BrZ); Met, Asn, Gin are side-chain unprotected. 
Non-thioester peptides are synthesized on a -OCH 2 -PAM-resin. Peptides are 
deprotected and simultaneously cleaved from the resin support using HF/p-cresol 
according to standard Boc chemistry procedure. The peptides are purified by 
preparative C4 reversed-phase-high pressure liquid chromatography (RP-HPLC). 

25 Fractions containing pure peptide are identified using ES-MS (electrospray 
ionization mass spectrometry), pooled and lyophilized for subsequent ligation. For 
peptides containing protecting groups not removed in HF/p-cresol, the protecting 
groups are retained, for instance, such as Acm-protected cysteines, and DNP- 
protected histidines. 



Acm and DNP Protecting Group Removal 

DNP-removal from histidines occurs under the nucleophilic conditions 
employed for native chemical ligation. For Acm-protected peptides, Acm-removal 
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is performed following standard mercury-acetic acid conditions by dissolving the 
peptide in TFE at a concentration of 10-70 mg/mL Alternatively, t he reaction is 
performed in HPLC buffer. The solution is diluted with 4M freshly prepared urea, 
for a final concentration of peptide of 4 mg/ml and TFE of 20%. A threefold molar 
5 excess (relative to the total expected cysteine concentration) of a 30 mg/ml 
Hg(acetate) 2 solution in 3% aqueous acetic acid is added and the solution is stirred 
for one hour. The solution is then made 20% in p-mercaptoethanol, and 60% 100 
mM acetate, pH4 containing 6M guanidinium chloride. The solution is loaded onto 
C4 coated reversed-phase beads and desalted by washing with low organic buffer 
10 (15% aqueous isopropanol, 0.1% TFA) followed by elution with high-organic buffer 
(90% aqueous isopropanol, 0.1% TFA). Fractions containing the desired product 
are identified by ES-MS and pooled and lyophilized. 

Native Chemical Ligation 

15 Native chemical ligation is carried out by admixing a first peptide bearing an 

N-terminal cysteine with a 1.5 fold excess of a second peptide (the peptide- 
thioester segment), and dissolved in 200 mM phosphate buffer (pH 7.9) containing 
6 M guanidinium chloride at a concentration of 2-4 mM concentration and 1% 
thiophenol is added. After ligation, 1 equivalent (eq) v/v TFE (trifluoroethanol), 2 

20 eq v/v 6M guanidinium chloride, 100 mM Tris-HCI, pH 8.5 and 1eq v/v p- 
mercaptoethanol is added to the ligation mix and incubated for 30 minutes. The 
solution is acidified with a solution of 15 mg/ml TCEP (tris(2- 
carboxyethyl)phosphine.HCI) in glacial acetic acid and loaded onto a preparative 
C4 reversed-phase HPLC column (1 inch diameter). The peptides are then 

25 purified by preparative gradient reversed-phase HPLC. Fractions containing the 
desired ligated product are identified by ES-MS and pooled. 

Example 2: Synthesis of Water-Soluble Polymer Thioester-Generating 
Resin 

30 The following Boc-protected amino acid (aa x )-thioester water-soluble 

polymer generating resin shown below is prepared for synthesis of water-soluble 
peptide thioesters: 
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where R' is the side chain of the first amino acid for the Boc-SPPS of a 
5 target peptide, R 7 and Ra are each individually hydrogen, -CH 3 , or-CH(CH 3 )2, and 
where n is a discrete integer from 2 to 12. 

On a 0.5mM scale 400 mg succinic anhydride is dissolved in 8m! of 0.5M 
HOBT (N-hydroxybenzotriazole) containing 400ul DIEA is coupled to NH2-Leu- 
PAM-resin (substitution -0.5 mmol/g) for 10 minutes. The resin is activated with 

10 8ml of fresh 1 M CDI (Carboxydiimidazole) solution in DMF and 4mL (4,7,10)- 
trioxatridecane-1,13diamine (TTD) is added in 4mL 0.5M HOBT solution coupled 
for 30 minutes. After n cycles of this process, and depending on the target peptide 
sequence, different starting resins are constructed to provide the first amino acid 
for Boc-SPPS in the target sequence by one of two protocols. 

15 In one protocol, a selected carboxylic acid, such as acylmercaptoacetic 

acid, is activated in DCC (1 ,3-dicyclohexylcarboiimide) with PfpOH 
(pentafluorophenyl ester) to give S-Acetylthioglycolic acid pentafluerophenyl ester 
(SAMA-Opfp) and coupled to the resin following standard DCC/PfpOH activation 
and coupling protocols (e.g., "Fmoc Solid Phase Peptide Synthesis, A Practical 

20 Approach," Eds., W.C. Chang and P.D. White, Oxford University Press, 2000). 
This is followed by removal of the acyl moiety by a piperdine cocktail containing a 
thiol or selenol, such as piperdine and p-mercaptoethanol in DMF, and then 
followed by coupling of a selected Boc-protected amino acid (typically 2.5 mmole) 
dissolved and activated in 4.75 ml 0.5 M (2.375 mmole) HBTU in DMF containing 

25 150 microliter (111 mg, 0.825 mmole) DIEA, typically for 1 hour with vortex 
agitation, followed by draining. 

In a second protocol, a selected preformed Boc-amino acid thioester (e.g., 
Boc-protected amino acid containing a thioester) (typically 0.5 mmol) is suspended 
in DCM (1mL) and DIEA (1.5 mmol) added. The solution is vortexed at room 

30 temperature for 2 m in and added to the resin. Solid HATU (0.5 mmol) is then 
added directly to the resin-mixture, mixed and stirred occasionally for 30 min. 
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For both protocols, the resin is then drained, and washed with DCM, DMF, 
DCM, and then dried in vacuo for 1 h to constant weight. 

Example 3: Synthesis of Peptide-GRFN 1712 

5 Peptides GRFN-1712 and GRFN-1 71 2-PLP 3 having the sequences 

depicted below are synthesized in accordance with Example 1 on a 0.15 mmol 
scale using HBTU coupling chemistry and Boc-SPPS. For GRFN-1 71 2-PLP 3 -Leu 
the following Boc-tryptophan(formyI)-thioester-PLP3 generating resin is used: 



o 




Amino acids free of ethyl acetate are used as precursor materials. The 
resulting peptide-thioesters are deprotected and simultaneously cleaved free of the 
supporting resin and linker in HF/p-cresol (9 to 1 ratio) at 0°C for one hour. 

15 GRFN-1712 

CSLNE KITVP DTKVN FYAWK RMEVG QQAVE VWQGL 
ALLSE AVLRG QALLV KSSQP W-thioester (where Cys 1 is 
Acm protected) (SEQ ID NO:1) 

20 GRFN 1712-PLP 3 

CSLNE KITVP DTKVN FYAWK RMEVG QQAVE VWQGL 
ALLSE AVLRG QALLV KSSQP W-thioester-PLP 3 -L (where 
Cys 1 is Acm protected) (SEQ ID NO:2) 

25 Example 4: Handling Properties of Peptide-GRFN 1712 

The deprotected and cleaved GRFN 1712-thioester-PLP 3 -Leu prepared in 
accordance with Example 3 is completely dissolved in 50% aqueous acetonitrile 
containing 0.08% trifluoroacetic acid (TFA). Analysis of the crude cleavage 
product for the 1712-thioester-PLP 3 by HPLC and Electrospray Mass 
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Spectroscopy (ES-MS) shows it to be the major product, including a very sharp 
and predominant peak by HPLC analysis for the desired product. In comparison, 
the standard 1712 peptide-thioester (i.e., without the water-soluble polymer) 
prepared in accordance with Example 3 requires TFE and TFA for dissolution; 
5 also, HPLC and ES-MS analyses showed a complex heterogenous crude cleavage 
product. 

Relative solubility of the 1712-PLP3-Leu thioester and the standard 1712 
thioester also are tested in 50% ACN/H 2 0. The 1712-PLP 3 -Leu thioester (10 
mg/mL) is completely dissolved, whereas the standard 1712 thioester is only 
1 0 partially soluble at low concentration. 

Example 5: Synthesis of Peptide-GRFN 1 852-PLP 3 -Leu 

The following thioester peptides having the amino acid sequences shown 
below are synthesized in accordance with Example 1 on a 0.15 mmol scale using 
15 HBTU coupling chemistry and Boc-SPPS. For GRFN 1852-PLP 3 -Leu, the 
following Boc-leucine-thioester-PLP 3 -Leu generating resin is used: 




20 

Amino acids free of ethyl acetate are used as precursor materials. The 
resulting peptide-thioesters are deprotected and simultaneously cleaved free of the 
supporting resin and linker in HF/p-cresol (9 to 1 ratio) at 0°C for one hour. 

25 GRFN 1852 

CLSQL HSGLF LYQGL LQALE GISPE LGPTL DTLQL DVADF 
ATTIW QQMEE L-thioester (SEQ ID NO:3) (where Cys 1 is Acm 
protected) 
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GRFN 1852-PLP 3 

CLSQL HSGLF LYQGL LQALE GISPE LGPTL DTLQL DVADF 
ATTIW QQMEE L-thioester-PLP 3 -Leu (SEQ ID NO:4) (where 
Cys 1 is Acm protected) 

5 

Example 6: Handling Properties of Peptide-GRFN 1 852 

As with GRFN peptide 1712, the deprotected and cleaved GRFN 1852- 
PLP3-l_eu containing a thioester is completely dissolved in 50% aqueous 
acetonitrile containing 0.08% trifluoroacetic acid (TFA). In contrast, the non- 
10 polymer modified peptide GRFN 1852 is poorly soluble. In addition, when 
comparing the purity of the crude GRFN 1852-PLP 3 thioester cleavage product as 
compared to that of the standard GRFN 1852 (i.e., without the water-soluble 
polymer) by ES-MS or HPLC, the water-soluble polymer thioester peptide is clearly 
better (See, e.g., the HPLC chromatogram illustrated in Figures 3A-3B). As 
15 illustrated in Figure 3B, the major peak is the target product. 

The above results and discussion demonstrate that the subject methods 
provides improved methods of SPPS, particularly in the SPPS generation of 
thioester and selenoester compounds, where advantages of the invention include 
20 improved solubility and/or reduced aggregation of crude peptide products, leading 
to better purification, higher yield, and other advantages. As such, the subject 
methods represent a significant contribution to the art. 

All publications and patent application cited in this specification are herein 
25 incorporated by reference as if each individual publication or patent application 
were specifically and individually indicated to be incorporated by reference. The 
citation of any publication is for its disclosure prior to the filing date and should not 
be construed as an admission that the present invention is not entitled to antedate 
such publication by virtue of prior invention. 
30 Although the foregoing invention has been described in some detail by way 

of illustration and example for purposes of clarity of understanding, it is readily 
apparent to those of ordinary skill in the art in light of the teachings of this invention 
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that certain changes and modifications may be made thereto without departing 
from the spirit or scope of the appended claims. 
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